Reverse spatial top-k keyword queries
نویسندگان
چکیده
Abstract We introduce the R everse S patial Top-k K eyword (RSK) query, which is defined as: given a query term q, an integer k and neighborhood size find all neighborhoods of that where q in top-k most frequent terms among social posts those . An obvious approach would be to partition dataset with uniform grid structure cell identify cells this keywords. However, answer incomplete since it only checks for are perfectly aligned grid. Furthermore, every (square) answer, we can define infinitely more result by minimally shifting square without including it. To address that, need contiguous regions any point region center satisfies query. propose algorithm efficiently RSK using index consisting augmented materialized lists frequencies. apply various optimizations drastically improve latency against baseline approaches. also provide theoretical model choose optimal minimize latency. further examine restricted version problem (RSKR) limits scope efficient approximate algorithms. Finally, how parallelism performance balancing workload smart load slicing technique. Extensive experimental evaluation proposed methods real Twitter datasets crime report datasets, shows efficiency our accuracy model.
منابع مشابه
Efficient Processing of Top-k Spatial Keyword Queries
Given a spatial location and a set of keywords, a top-k spatial keyword query returns the k best spatio-textual objects ranked according to their proximity to the query location and relevance to the query keywords. There are many applications handling huge amounts of geotagged data, such as Twitter and Flickr, that can benefit from this query. Unfortunately, the state-of-the-art approaches requ...
متن کاملEfficient Reverse Top-k Boolean Spatial Keyword Queries on Road Networks
Reverse k nearest neighbor (RkNN) queries have a broad application base such as decision support, profile-based marketing, and resource allocation. Previous work on RkNN search does not take textual information into consideration or limits to the Euclidean space. In the real world, however, most spatial objects are associated with textual information and lie on road networks. In this paper, we ...
متن کاملIndexing Reverse Top-k Queries
We consider the recently introduced monochromatic reverse top-k queries which asks for, given a new tuple q and a dataset D, all possible top-k queries on D ∪ {q} for which q is in the result. Towards this problem, we focus on designing indexes in two dimensions for repeated (or batch) querying, a novel but practical consideration. We present the novel insight that by representing the dataset a...
متن کاملReverse Engineering Top-k Join Queries
Ranked lists have become a fundamental tool to represent the most important items taken from a large collection of data. Search engines, sports leagues and e-commerce platforms present their results, most successful teams and most popular items in a concise and structured way by making use of ranked lists. This paper introduces the PALEO-J framework which is able to reconstruct top-k database q...
متن کاملContinuous Monitoring of Top- k Spatial Keyword Queries in Road Networks
Recently, spatial keyword queries (SKQ) have become a hot topic in database field. However, Most of the existing SKQ methods are limited in Euclidean space or assume that objects (and queries) are static. This paper addresses the issue of processing continuous top-k spatial keyword queries over moving objects (CMTkSK) in road networks. To efficiently index moving geo-textual objects in road net...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Vldb Journal
سال: 2022
ISSN: ['0949-877X', '1066-8888']
DOI: https://doi.org/10.1007/s00778-022-00759-9